Trainability and Accuracy of Artificial Neural Networks: An Interacting Particle System Approach

نویسندگان

چکیده

Neural networks, a central tool in machine learning, have demonstrated remarkable, high fidelity performance on image recognition and classification tasks. These successes evince an ability to accurately represent dimensional functions, but rigorous results about the approximation error of neural networks after training are few. Here we establish conditions for global convergence standard optimization algorithm used learning applications, stochastic gradient descent (SGD), quantify scaling its with size network. This is done by reinterpreting SGD as evolution particle system interactions governed potential related objective or "loss" function train We show that, when number $n$ units large, empirical distribution particles descends convex landscape towards minimum at rate independent $n$, resulting that universally scales $O(n^{-1})$. properties established form Law Large Numbers Central Limit Theorem distribution. Our analysis also quantifies scale nature noise introduced provides guidelines step batch use illustrate our findings examples which learn energy continuous 3-spin model sphere. The predicts dimension $d=25$.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Trainability in Recurrent Neural Networks

Two potential bottlenecks on the expressiveness of recurrent neural networks (RNNs) are their ability to store information about the task in their parameters, and to store information about the input history in their units. We show experimentally that all common RNN architectures achieve nearly the same per-task and per-unit capacity bounds with careful training, for a variety of tasks and stac...

متن کامل

An Approach of Artificial Neural Networks Modeling Based on Fuzzy Regression for Forecasting Purposes

In this paper, a new approach of modeling for Artificial Neural Networks (ANNs) models based on the concepts of fuzzy regression is proposed. For this purpose, we reformulated ANN model as a fuzzy nonlinear regression model while it has advantages of both fuzzy regression and ANN models. Hence, it can be applied to uncertain, ambiguous, or complex environments due to its flexibility for forecas...

متن کامل

Prediction the Return Fluctuations with Artificial Neural Networks' Approach

Time changes of return, inefficiency studies performed and presence of effective factors on share return rate are caused development modern and intelligent methods in estimation and evaluation of share return in stock companies. Aim of this research is prediction of return using financial variables with artificial neural network approach. Therefore, the statistical population of this study incl...

متن کامل

Capacity and Trainability in Recurrent Neural Networks

Two potential bottlenecks on the expressiveness of recurrent neural networks (RNNs) are their ability to store information about the task in their parameters, and to store information about the input history in their units. We show experimentally that all common RNN architectures achieve nearly the same per-task and per-unit capacity bounds with careful training, for a variety of tasks and stac...

متن کامل

scour modeling piles of kambuzia industrial city bridge using hec-ras and artificial neural network

today, scouring is one of the important topics in the river and coastal engineering so that the most destruction in the bridges is occurred due to this phenomenon. whereas the bridges are assumed as the most important connecting structures in the communications roads in the country and their importance is doubled while floodwater, thus exact design and maintenance thereof is very crucial. f...

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Communications on Pure and Applied Mathematics

سال: 2022

ISSN: ['1097-0312', '0010-3640']

DOI: https://doi.org/10.1002/cpa.22074